I-prune: Item selection for associative classification

نویسندگان

  • Elena Baralis
  • Paolo Garza
چکیده

Associative classification is characterized by accurate models and high model generation time. Most time is spent in extracting and post-processing a large set of irrelevant rules, which are eventually pruned. We propose I-prune, an item pruning approach that selects uninteresting items by means of an interestingness measure and prunes them as soon as they are detected. Thus, the number of extracted rules is reduced and model generation time decreases correspondingly. A wide set of experiments on real and synthetic datasets has been performed to evaluate I-prune and to select the appropriate interestingness measure. The experimental results show that I-prune allows a significant reduction in model generation time, while increasing (or at worst preserving) model accuracy. Experimental evaluation also points to the chi-square measure as the most effective interestingness measure for item pruning.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrating classification capability and reliability in associative classification: A beta-stronger model

Mining class association rules is an important task for associative classification and plays a key role in rule-based decision support systems. Most of the existing methods try the best to mine rules with high reliability but ignore their capability for classifying potential objects. This paper defines a concept of -stronger relationship, and proposes a new method that integrates classification...

متن کامل

Determining the effective features in classification of heart sounds using trained intelligent network and genetic algorithm

Heart diseases are among the most important causes of mortality in the world, especially in industrial countries. Using heart sounds and the features extracted from them are among the non-aggressive diagnosis and prognosis methods for heart diseases. In this study, the time-scale, Cepstral, frequency, temporal and turbulence features are saved and extracted from the heart sounds, and then they ...

متن کامل

A tree-projection-based algorithm for multi-label recurrent-item associative-classification rule generation

Associative-classification is a promising classification method based on association-rule mining. Significant amount of work has already been dedicated to the process of building a classifier based on association rules. However, relatively small amount of research has been performed in association-rule mining from multi-label data. In such data each example can belong, and thus should be classi...

متن کامل

Feature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine

We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...

متن کامل

Associative recognition in mild cognitive impairment: relationship to hippocampal volume and apolipoprotein E.

Associative memory involves remembering relations between items of information and is critically dependent on the hippocampus, a brain structure that shows early changes in amnestic mild cognitive impairment (aMCI) and Alzheimer's disease. We examined associative and item memory in aMCI with a focus on the role of medial-temporal lobe regions and genetic risk for Alzheimer's disease. Twenty-fou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Int. J. Intell. Syst.

دوره 27  شماره 

صفحات  -

تاریخ انتشار 2012